test: re-enable sql_hive-1 for Spark 4.0 and fix two small failures by andygrove · Pull Request #4047 · apache/datafusion-comet

andygrove · 2026-04-23T13:27:30Z

Which issue does this PR close?

Closes #2946.
Closes #4049.

Rationale for this change

The sql_hive-1 job for Spark 4.0 has been excluded from CI since #2946 was filed (hang/timeout on the Hive suite with JDK 17). A re-run on current main shows the hang no longer reproduces: the job completes in about 63 minutes. It does surface two small issues, which this PR also fixes, so the job can be re-enabled and kept green.

What changes are included in this PR?

.github/workflows/spark_sql_test.yml: remove the exclude matrix entry that skipped sql_hive-1 for spark-4.0.1 / java 17 / auto.
spark/src/main/spark-4.0/org/apache/spark/sql/comet/shims/ShimSparkErrorConverter.scala: the FileNotFound case was producing a SparkFileNotFoundException with error class _LEGACY_ERROR_TEMP_2055, which was removed in Spark 4.0. Delegate to QueryExecutionErrors.fileNotExistError(path, cause) so the error carries FAILED_READ_FILE.FILE_NOT_EXIST, which is what HiveMetadataCacheSuite (and similar checkError assertions) expect.
dev/diffs/4.0.1.diff: drop the hunk that commented out assume(new java.io.File(jarPath).exists) in HiveUDFDynamicLoadSuite. Spark 4.0 no longer ships hive-test-udfs.jar, so restoring the upstream assume cancels the five UDF tests cleanly instead of running them without the required classes and failing with CANNOT_LOAD_FUNCTION_CLASS.

How are these changes tested?

The CI run on this PR exercises the full sql_hive-1 job for Spark 4.0. The previously failing suites behave as follows:

HiveMetadataCacheSuite (4 failures): now pass against the corrected error class.
HiveUDFDynamicLoadSuite (5 failures): now cancelled on Spark 4.0 (jar absent); still run on Spark 3.4/3.5 (jar present).

…l reproduces

The Spark 4.0 `ShimSparkErrorConverter` was converting native FileNotFound errors into a `SparkFileNotFoundException` with error class `_LEGACY_ERROR_TEMP_2055`, but that error class was removed in Spark 4.0. Throwing it triggers an internal error ("Cannot find main error class") and fails tests such as `HiveMetadataCacheSuite` that assert on `FAILED_READ_FILE.FILE_NOT_EXIST`. Delegate to `QueryExecutionErrors.fileNotExistError`, which is the 4.0 replacement for `readCurrentFileNotFoundError` and produces the expected error class and `path` parameter.

The 4.0.1 Spark diff commented out the `assume(new java.io.File(jarPath).exists)` guard in HiveUDFDynamicLoadSuite. The jar (hive-test-udfs.jar) is not shipped with Spark 4.0, so without the guard the five UDF/UDAF/UDTF tests run without the required class and fail with CANNOT_LOAD_FUNCTION_CLASS. Drop the hunk so upstream's assume() is preserved. The tests are cancelled on Spark 4.0 (jar missing) and continue to run on Spark 3.4/3.5 (jar present), matching upstream behavior. Closes apache#4049.

The upstream assume(jarPath.exists) runs during class construction (inside udfTestInfos.foreach, before test() registers a case), so when hive-test-udfs.jar is absent - as it is on the v4.0.1 release tag - the TestCanceledException propagates out of <init> and ScalaTest marks the whole suite as aborted, failing the job. Mix in IgnoreCometSuite so the five tests are reported as ignored under Comet, and comment out the constructor-time assume so it no longer aborts the suite.

comphead

its LGTM, @parthchandra FYI

parthchandra · 2026-04-24T19:41:27Z

-            errorClass = "_LEGACY_ERROR_TEMP_2055",
-            messageParameters = Map("message" -> s"File $path does not exist")))
+          QueryExecutionErrors
+            .fileNotExistError(path, new FileNotFoundException(s"File $path does not exist")))


I think Spark 4.0 will use - FAILED_READ_FILE.FILE_NOT_EXIST instead of the legacy error class. Not a blocker (See https://github.com/apache/spark/blob/c241d5ad4a2372bbddc7dd8339987a09f501dc36/sql/catalyst/src/main/scala/org/apache/spark/sql/errors/QueryExecutionErrors.scala#L879)

Yes, that is what this code change achieves? Maybe I am not understanding something here though

Oh right. The split line threw me off!

test: re-enable sql_hive-1 for Spark 4.0 to check if apache#2946 stil…

4cd9bc3

…l reproduces

andygrove added the spark 4 label Apr 23, 2026

andygrove mentioned this pull request Apr 23, 2026

Spark 4.0 sql_hive-1: HiveUDFDynamicLoadSuite fails to load UDF classes from hive-contrib #4049

Closed

andygrove changed the title ~~test: re-enable sql_hive-1 for Spark 4.0 to check if #2946 still reproduces~~ test: re-enable sql_hive-1 for Spark 4.0 and fix surfaced failures Apr 23, 2026

andygrove mentioned this pull request Apr 23, 2026

fix: use FAILED_READ_FILE.FILE_NOT_EXIST in Spark 4.0 FileNotFound shim #4048

Closed

andygrove changed the title ~~test: re-enable sql_hive-1 for Spark 4.0 and fix surfaced failures~~ test: re-enable sql_hive-1 for Spark 4.0 and fix two small failures Apr 23, 2026

andygrove marked this pull request as ready for review April 23, 2026 15:24

andygrove marked this pull request as draft April 23, 2026 17:38

andygrove marked this pull request as ready for review April 24, 2026 00:19

andygrove requested review from kazuyukitanimura and parthchandra April 24, 2026 00:21

comphead approved these changes Apr 24, 2026

View reviewed changes

parthchandra approved these changes Apr 24, 2026

View reviewed changes

andygrove merged commit 2cb6142 into apache:main Apr 24, 2026
178 of 179 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: re-enable sql_hive-1 for Spark 4.0 and fix two small failures#4047

test: re-enable sql_hive-1 for Spark 4.0 and fix two small failures#4047
andygrove merged 4 commits intoapache:mainfrom
andygrove:test-hive1-spark4-retry

andygrove commented Apr 23, 2026 •

edited

Loading

Uh oh!

comphead left a comment

Uh oh!

parthchandra Apr 24, 2026

Uh oh!

andygrove Apr 24, 2026

Uh oh!

parthchandra Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andygrove commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

parthchandra Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

andygrove Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andygrove commented Apr 23, 2026 •

edited

Loading